Combinatorial Spaces And Order Topologies 

Philon Nguyen 



Abstract. An archetypal problem discussed in computer science is the prob- 
lem of searching for a given number in a given set of numbers. Other than 
sequential search, the classic solution is to sort the list of numbers and then ap- 
ply binary search. The binary search problem has a complexity of 0(logN) for 
a list of N numbers while the sorting problem cannot be better than O(N) on 
any sequential computer following the usual assumptions. Whenever the prob- 
lem of deciding partial order can be done in O(l), a variation of the problem 
on some bounded list of numbers is to apply binary search without resorting 
to sort. The overall complexity of the problem is then 0(logR) for some radius 
Ft. The following upper-bound for finite encodings is shown: 

OpoglXU) 

Also, the topology of orderings can provide efficient algorithms for search prob- 
lems in combinatorial spaces. The main characteristic of those spaces is that 
they have typical space complexities of 0(2 ), O(Nl) and 0(N N ). The facto- 
rial case describes an order topology that can be illustrated using the combi- 
natorial polytope . When a known order topology can be combined to a given 
formulation of a search problem, the resulting search problem has a polylog- 
arithmic complexity. This logarithmic complexity can then become useful in 
combinatorial search by providing a logarithmic break-down. These algorithms 
can be termed as the class of search algorithms that do not require read and 
are equivalent to the class of logarithmically recursive functions. 



1. Computable Structures 

A few terminological remarks can be made on the algebra of computable struc- 
tures. 

Remark 1.1. A computable space is denned using the distinguishability prob- 
lem. 

Remark 1.2. A computable space is said to be complete if it can represent a 
Turing-complete space. 

Furthermore, the following can be noted: 

Remark 1.3. The equality operator is defined using the cut metric of Equation 
This is denoted 



3.0.11 



A 



Whenever J\f = 0, the conventional equality operator can be used. 
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Remark 1.4. Let T n be a sequence such that |T| = n. A computable sequence 
is such that the following holds: 

(1.0.1) /(T n ) = S 

for some computable function / and for a set S such that: 

(1.0.2) T™ ^ T™, S^S, 

and: 

(1.0.3) T? = T], Si = Sj 

Remark 1.5. Let / be a computable function, then / has a representation in 
a complete computable space. 

Definition 1.6. A domain F™ on a function / is termed a factorial domain if 
the following holds: 

(1-0.4) /(x 4 )^/(x J ), x^x, 

and: 

(1-0.5) /(x 4 ) = /(x J ), x i = x, 

2. Order Topologies 

An order can be best described as a subset of the integral line. Let X define a 
discrete space such that / : X — >• Y defines an orderable discrete space Y. Or(Y) 
then define the order of Y and a compact order is such that: 

(201) AOrOO 

Whenever Y is termed orderable, then the following holds: 
(2.0.2) Yi -Yj =k 

for some value k. The difference operator AY is then defined on a differentiable 



representation of X. Equation 2.0.1 defines the sequence of integers (n, n+1, . . . , n+ 
k) or, alternatively, a sequence of ordered incremental values (a , + . . . , a +n5) 
for some unit incremental operator S. This can be denoted Oi'i(X). Given an 
arbitrary order Or(Y), the following holds: 

(2.0.3) Ori(Y) =7r(Or(Y)) 

for some permutation tt. Whenever the permutation can be found seeminglessly, it 
can be said that Ori(Y) is equivalent to Or(Y). 

Example 2.1. The space of rank orders is given by the set of all permutations 
of the form 7r(l, 2, . . . , n) [§]. Figure [l] illustrates the Cartesian representation of 
an order over a prime factorial domain given by: 

(2.0.4) Gl(7r t (a 1 ,a 2 ,a 3 ))=pT< 1 p a 2> 2 p7' 3 

where the G-operator defines a g-code of length 3 on a set of cardinality 3. A 
candidate distance measure in Cartesian space which preserves Or o G3(7r(l, 2, 3)) 
is given by: 

(2.0.5) d(x,y) = ||wx-wy|| 
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Figure 1. Polytope of Or(7r(l,2,3)) in a prime factorial domain. 

for some given weight vector w and x,y£ 7r(l, 2, 3). It follows that: 

(2.0.6) Ori(7r(l, 2, 3)) = ^(Or o G|(tt(1, 2, 3))) 

for p = (2,3,5). Equivalently, an l-code representation can be given as: 

^ Q _ L^Tr^a!, a 2 ,a 3 )) = [a itl logpij log + 

|a»,2 logp 2 J log + [ai, 3 logP3j log 

where the log-floor operator keeps a logarithmic number of digits in regards to the 
largest value in the code. In practice, this can be computed using a digit-by-digit 
radix representation. 

Example 2.2. Let the following log-prime factorial domain on an index func- 
tion 1 = (1, . . . , 2^)2 be defined as: 

JV 

(2.0.8) L^(l t )=Y / a i [\ogp l \ log , a,e(0,l) 

It follows that the subsets of the Z-codes of Equation |5.0.3| are given by: 

l^(i(i w xi 1 ))=i 1 l;(i) 

(2.0.9) A 



log 



for kj £ (0, . . . , N) and some arbitrary sequence of ordered primes. An illustration 
of the case N = 4 is shown in Figure [2| 

Let ~X. N define the product space X x . . . x X such that C X. N . If there 
exists some ordering differentiable to one, then then the order is termed cyclical. 
For some given ^-code representation, we have: 

(2.0.10) A0r(,^ ( (LMX))) ) = 1 

given some reshaping permutation operator ~k% j k... 
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K=3 



K=2 



K=l,2,3 



Figure 2. Order curve. 



Example 2.3. The indicator function X = (1,2,..., 2^)2 has a (g, i)-code rep- 
resentation such that any point on in Or(L^(I)) can be computed using 0(N log 2) 
recursions, or alternatively, 0(1) when the order is known. 

Example 2.4. The 0(2KM{K + M)) Jackson-Sheridan-Tseitin transform (6| 
10 is given by: 



(2.0.11) 



S = 



(l£ ® la) 


T 


B 





(Im ® I2) 


T 



for some random matrix B such that b^j € (0, 1) 



3. Distance Equations 

Formally, the notions of distance spaces, norms and metrics are defined in the 
following definitions. 

Definition 3.1. A distance measure d:XxX— >Aisa non-invertible mapping 
that has the following properties: 

(3.0.1) d(xj,Xi) = 0, Xi GX 

(3.0.2) d(xj,Xj) = d(xj,Xj), x^x^ G X 

for some non-empty set X and an alphabet A isomorphic to an arbitrary subset of 
I,, (X,d) defines a distance space. 

Given a distance matrix D g IR^ xAr on some finite discrete space X, the map- 
ping fo g is invertible whenever 3 : X x X — ¥ A is invertible. The distance measure 
on the constructed space is then given by /. 

Theorem 3.2. A distance space is uniquely defined by A. 
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Figure 3. Relation diagrams. 



By convention, (X, d) or A then denote an arbitrary distance space. It can also 
be shown that A e R, A e R 2 , A E R^ xN+1 and AelU R^ xN arc sufficient, 
under different assumptions. Any binary function that satisfies Equation 3.0.2 can 
form a distance space by imposing the following: 

(3.0.3) d(xi,Xj) = /'(/(x l ,x J ),/(x J ,x J )) 

for some function / that satisfies Equation |3.0.2| Other such measures have been 
used. A distance space embeds a collection of elements onto the real line and 
induces a poset topology and an order topology. 

Definition 3.3. Given some non-empty set X, a metric space (X, d) is a space 
equipped with a metric distance measure d defined using Equations |3.0.1| and 3.0.2 
with the additional property: 

(3.0.4) d(xi,x k ) < d(xi,Xj) + d(x jf x k ), x^x^XfeSX 

Metricity preserves heuristic notions such as distance minimization. Similarly, 
scmimctrics and ultrametrics can be defined by constraining or varying the standard 
properties. 

Definition 3.4. A norm ||-|| is a function / : X — >• A such that: 
(3.0.5) ||^ x «|| = 1^1 || Xi ll 

(3.0.6) pq+Xj < ||xj| + Xj 

(3.0.7) ||°||= A o 

for some zero vector 0, a zero element Ao, k £ K and Xj,xy G X. X then defines a 
normed space. 

Norms preserve metricity and linearity. Also, the notion of mapping of metric 
spaces is defined as follows. 

Definition 3.5. A mapping / : X -» Y of a space (X, d) into a space (Y, d') 
exists if the following holds: 

(3.0.8) T-d(xi,Xj) < rf'(/(x,),/(x J )) < k 2 d(xi,Xj) 

such that Xj, Xj € X, for some ki,k 2 > 1. 
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An isomorphic mapping is then defined as a (l,l)-distortion mapping. An iso- 
metric mapping then defines an isomorphic mapping on a metric space. Mappings 
of metric spaces can be generalized to mappings of arbitrary spaces using the same 
definition. 

On finite discrete spaces, mappings are given by the set of functions / : 
R+ xN -> R+ xN such that some parameters (fci,fe 2 ) hold. 

Whenever (k±, fc 2 )-distortions can be defined, the distance matrix of the image 
is said to be bounded by the distance matrix of the domain on some arbitrary 
function. In the context of similarity search, distortions usually refer to a lower- 
bounding distortion whenever k 2 < 1. An upper-bounding distortion is then given 
by k 2 > 1. 

In the context of compression, the usual metric is given on some vector space 

by: 

(3.0.9) k= ||/(x)-x|| 00 , xeX 

The distortion is then either (fc, k), [k%, k) or (k, k 2 ). 

Typical application domains of mappings include computational geometry, ap- 
proximation algorithms and functional analysis. 

Other problems of practical importance include mappings of random walks into 
the plane, mappings into binary codes and mappings onto the n-sphere. 

A finite discrete space is described in Definition |3.6| and refers to finite com- 
putable discrete spaces. 

Definition 3.6. Let / : X — » Y be a mapping from some arbitrary space 
(X, d) into a space (Y,d). Whenever the following holds for all y;, yj G Y: 

(3.0.10) ^d( yil y^) = 

given a cut metric defined as: 

,o nm r](v v i, A% 4 )nAA( yj )^0 

(3.0.11) dfo.y,)-! 0i A/"(y 4 ) nAftyj) = 

and some arbitrary neighborhood Af, (Y, d) defines a discrete space. When Y is 
finite, / generates a finite discrete space. (X, d) is then defined as the represented 
space. 

Corollary 3.7. A finite discrete space can be constructed if and only if its 
elements have an arbitrary neighborhood that can be computed exactly in the classical 
sense. 

Corollary 3.8. A countable subset of the real line containing arbitrary tran- 
scendentals is not a computable discrete space whenever the value of the transcen- 
dentals are considered. 

Proof. Let X = {xi, . . . , x„} define some space (X, d) for some transcenden- 
tals Xj. Given that the halting problem and the machine equivalence problem are 
undecidable, the following problem cannot be decided for arbitrary values: 

(3.0.12) AA(xj) njV(xj) = 

□ 
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Equivalently, Equation |3.0.11| cannot be decided on a classical machine for 
arbitrary subsets of the real line. 

Corollary 3.9. The problem of defining any discrete space can be decided 
upon whenever an arbitrary relation on all its elements can be decided. An arbitrary 
graph is then defined. 

Example 3.10. The space defined by: 

N 



(3.0.13) 



X = |J {tt + l/i, 1/i} 



is a computable finite discrete space under the following distance matrix: 



(3.0.14) 



D 



i = 1 
i + 1 

2N 
2N + 1 



i = l 




b 

a 

\a + b\ 



2N 



a 

\a-b\ 


h 



for some element a of an arbitrary alphabet and b given as follows: 
(3.0.15) ' ' 

such that o±i)£ R+ . 



i l + i 



Example 3.11. Let the following arbitrary curve be defined on the following 
sequence of iterates: 



(3.0.16) 



fi(x) = fi-i{x) + S 



for some infinitessimal operator S. Then, / defines a discrete space under the 
following distance matrix: 







fo(x) ... 




fo(x) 





(3.0.17) 








fi(x) 


••• 



Example 3.12. A discrete space can be defined on the following sequence of 
iterates: 



(3.0.18) 



fi(x) = Mx)+Sl5 
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for some uncomputable number Q 6 [0, 1] [2j. A distance matrix can then be given 
as: 




4. Distance Equations On Finite Discrete Spaces 



Some known notions often associated to search algorithms can be formalized 
using the distance equation concept. 

An example is similarity search in some n-dimensional space equipped with the 
£ p metric, which has found many interesting algorithmic solutions. 

Definition 4.1. Let XUqbe some finite discrete space. A distance equation 
is defined as follows: 



for all values of k £ K such that KC1. 

Corollary 4.2. Any distance equation on some finite discrete space defines a 
set of open or closed intervals. 

Corollary 4.3. A distance equation is defined on a continuous space if and 
only if k G K for some IC1, 

The complexity of defining a distance equation on a discrete space can then 
be seen as equivalent to the complexity of defining an equivalent distance equation 
on a continuous space whenever the corresponding representations of X are of the 
same order. 

A distance equation is then said to define a fc-partition of the real numbers. 

Definition 4.4. Two distance equations are isomorphic if they define two 
fc-partitions p, q such that \p\ = \q\. 

Definition 4.5. A composite distance equation is defined using standard set 
operations on distance equations: 



The solution of the equation is the subset of X that satisfies the equation and 
defines subrcgions of the distance space induced by the order topology. 
Whenever the set is viewed as a matrix, it is given by: 



Theorem 4.6. Let X be a finite discrete space, for any distance equation S 
defined on a computable distance function, X is well-defined. 



(4.0.1) 



<J(q) : rf(X,q) + fc = 



(4.0.2) 




(4.0.3) Xx = XX 

for some indicator function X. And more specifically, by absorption: 
(4.0.4) X z =2iXi, icN 
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Also, if 5 falls in the complexity class C whenever li can be computed in O(l), 
then it is customary to denote the resulting complexity class as C s . 

By extension, any low-complexity computation of 2^ denotes a complexity class 

r\ 

Let <5o(q) define a distance equation with a solution equal to an arbitrary el- 
ement chosen uniformly at random from the subset of X satisfying the adjoint 8 
equation. This is also referred to as the decision version of the search problem. 

The complexity of solving a distance equation 8 is equal to a multiple of the 
complexity of solving its corresponding <5o equation. The multiple is in the range 
[0,|X|]. 

4.1. Examples. The similarity search problem in some finite discrete space 
X equipped with the i\ metric is usually defined as follows: 

(4.1.1) %): ||X-q||i<e 

for some e £ R+. The same problem using £2 metrics is given as: 

(4.1.2) %): ||X-q|| a <e 

Similarly, the orthogonal range search problem can be defined as follows: 

(4.1.3) \JS(<ii) : ll X -*ll°° ^ 6 

i 

The random convex polytope search problem, also called the point location in 
arrangement of hyperplanes problem, is done on the discrete space X defined by 
the following arrangement: 

(4.1.4) \JA : A iq <bi 

i 

for some non-trivial matrices Aj with elements a* k and q,be W 1 . Equation 2.1.4 
can be translated to a distance equation such as: 

(4.1.5) |J S(q) : dimq + ]T BgQ*(A*^ - b)) = 

i 3 

It can also be noted that the sgn* function is here defined on the vertices of the 
n-cube HP as: 



(4.1.6) sgn* = 



1, x > 
-1, x < 



The solution of Equation |4.1.5| returns the indexes i of the discrete space X that 
solves the distance equation, or alternately the polytopes defined by (Aj, bj). 

It has been shown to be equivalent to the search formulation of a Jackson- 
Sheridan-Tseitin transform. 

Let (Y, d) be some finite discrete space such that X^QCY. Then, using the 
Hausdorff distance, the following distance equation can be defined: 

(4.1.7) \J8(Q)- l|Xj - Q\\ H +k = 

i 

for fee [ei, 62]. The Hausdorff distance is defined as usual: 

(4.1.8) ||X - Y|| H = max{sup xeX inf yeY rf(x,y), sup y6Y inf xeX rf(x, y)} 
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The overall complexity of Equations |4.1.1| |4.1.2| and |4.1.3| can be done, loosely, in 
at most (3(|X|dimX), where |X| denotes the cardinality of the set enclosing the 
space, following the usual assumptions. 



The overall comp lexity of Equation 4.1.7 is at most 0(|X||Q| dimX). On the 
other hand, Equation 4.1.4 is said to have a search complexity of 0(n 5 log N) I5l[sj . 



4.2. Classification, Equivalences and Dilations. 

4.2.1. Notation. For some distance measure d, a search problem denoted 5 on 
some finite discrete space X that can be solved using d is said to be in the class 
C(S, X, d). Also, let the dilation of an arbitrary mapping / of (X, d) into (Y, d') be 
given on the following relation: 

(4.2.1) — maxIX.J <:nmx|YJ < C2m ax|Xi| 

C\ i i i 

for some Ci, c-z > 1 and for all and for all subsets X; C X and C Y such that: 

(4.2.2) «5(qJ : d(X l ,q J ) + fc = 
and: 

(4.2.3) <5'(q0 : d'(Y u /(%)) + k' = 
and: 

(4.2.4) /(X,) C Y, 

If there is a (1, 0(l))-dilation, then C(S, X, d) is said to be equivalent to C(S' , Y, d'). 
The later is denoted as follows: 

(4.2.5) C(5,X,d)^C(S',Y,d') 
More specifically, ci is expected to grow as follows: 

(4.2.6) ^ = 1 + ^ 

C(S, X, d) is said to be probabilistically equivalent and bounded to C(S' , Y, d') when- 
ever dilations can be probabilistically bounded. This is denoted: 

(4.2.7) C(8,X.,d)^C(5,Y,d') 

k 

Two finite discrete distance spaces (X, d) and (Y, d') such that / : X — » Y are 
isomorphically embeddable if the following holds: 

(4.2.8) |Yi| = |Xi| 

for all and for all subsets X ; CX and Y C Y such that Equations |4.2.2[ |4.2.3| 
and |4.2.4"| hold. Given two distance measures d and d on some finite discrete spaces 
X and Y, if (X, d) embeds isomorphically into (Y, d'), then C(8, X, d) is said to be 
strictly equivalent to C(5', Y, d'). This is denoted as follows: 



(4.2.9) 



C(5,X,d) &C(5',Y,d') 



COMBINATORIAL SPACES AND ORDER TOPOLOGIES 



11 



4.2.2. Examples. Let the following equation be the general formulation of sim- 
ilarity search on some finite discrete metric space (X, d) [3jj5j : 

(4.2.10) S(q): d(X,q) < e 

with X C K n . Given some / : X — > Y, it follows that there exists finite discrete 
spaces Y for which Equation |4.2.10| is equivalent to is |4.1.1||4.1.2| and |4.1.3| The set 
of finite discrete spaces for which this is the case is such that the following holds: 

0(log|X|) = max l |^(/(q l ))nY| - 

4.2.11 

maxilB^q^HXl 

for all possible values of q € X, some £ p ranges fi™ of dimension n and radius e 
with distortion value k. Also, the following holds whenever a (k±, fc 2 )-distortion is 
defined: 

max, \B: (/(q,)) n Y| < max, |S £ " fcl ( qi ) n X| - 

(4.2.12) 



max; 



^(q.)nx 



Let / : M" — > H" be a mapping to the vertices of the n-cube defining the 
following nearest neighbor problem: 

(4.2.13) /(q) = {v e H" : minjft - Vi \\} 

The mapping / chooses the vertex of EP that is closest in terms of component-wise 
distances. The point location in arrangement of hyperplanes problem described by 
Equation |4. 1.5 can then be rewritten as follows: 

(4.2.14) <J(q): dim q + dim qcos (/(q), 1) = 

Therefore, Equation |4 . 1 . 5| is strictly yet perhaps trivially equivalent to a linear func- 
tion of the cosine distance, given some prior nearest-neighbor mapping. Equivalence 
to £ p can be shown if the following holds: 

(4.2.15) max||X- q|| < k 

for some constant k. The cosine distance is then strictly equivalent to a bounded 
£ p metric, denoted £ p , which in turn defines a subset of R™ of finite radius. It is 
also metrizable by using a convergent sequence that will yield an integral over a 
domain given by the range of the distance space. The range is then unbounded and 
cosine distances can be said to be strictly equivalent to l v metrics for finite discrete 
spaces. One such mapping for finite vector spaces that can serve this purpose is 
the limit: 

,tan 2 e/2 || x _ v ||l/ 2 

(4.2.16) cos(x,y)=/ " „ y|lp „ d||x-y|| p 



whenever Equation 4.2. 16| generates a computable distance space. As the differ- 
ential element tends to infinity, the integral yields the range of the cosine. The 
formulas provided by S. Ramanujan for the computation of tt can also serve as an 
example to the metrization of directional distance spaces. 
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4.3. Classic Theorems On Discrete Spaces. The class of distance equa- 
tions that are equivalent or strictly equivalent are tightly coupled with the prop- 
erties of their adjoint discrete spaces. Known results can be shown in finite vector 
spaces from the literature on the subject (SI. 

Theorem 4.7. In a finite vector space, all norms are equivalent given some 
mapping f : X — > Y. 



Theorem 14.71 is denoted: 

(4.3.1) cos.xj.y^c^Yj.y 

Corollary 4.8. Given any distance measures d and d', there exist finite dis- 
crete spaces X and Y such that: 

(4.3.2) C(8 ) X,d)4>C(5,Y,d') 

Proof. This can be trivially shown by simple linear manipulations of d and 
d' on discrete spaces of cardinality two. □ 

Theorem 4.9. (51 Let the cut metric be defined on some subset X C Y as 
follows: 

^ d ^ = {l] H^rtis^ 

Then, some distance metric d is such that: 

(4.3.4) C(S,Y,d)^C(S,Y,\\-\\ 1 ) 
if and only if d can be expressed as: 

(4.3.5) d= c 2 d(X,q) 

XCY 

for some nonnegative and non-zero coefficients Ci. 

Theorem 4.10. Given an arbitrary distance measure d, a metric d' and some 
mapping f : X Y ; the following holds for some finite vector spaces X C K™ and 
Y c K m with m = (n,3): 

(4.3.6) C(5,X,d) ^C(S,Y,d') 

Corollary 4.11. Given an arbitrary distance measure d and a metric d' some 
finite vector spaces X, Y C R" and some mapping / : X — > Y the following holds: 

(4.3.7) C(<5,X,d)«*C(*',Y,tO 

Theorem 4.12. Any finite metric space (X,d) is probabilistically equivalent 
and bounded to (Y, || • || p ). This is denoted: 

(4.3.8) C(d,X,d)->C(6,Y,\\.\\ p ) 

k 

for some X C K n and Y C M° (log |x|) . 

Theorem |4.12| is such that the relationship between distortions and dilations 
for the similarity search equation can be given by Equation |4.2.12| Such distortions 
include low-dimensional subset mappings and random projections. 
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4.4. Classic Theorems On Subspaces and Subsets. Known theorems on 
the representation of distance equations over vector spaces as distance equations 
over subspaces and subsets of vector spaces can be defined from the literature on 
the subject. 

Theorem 4.13. [4] Let (X, d) be a finite discrete metric space such that X C 
K™ and let Y be a linear subspace of X defined as follows: 

(4.4.1) ||yi||o = fc, y,£Y 
and: 

(4.4.2) y 2 -Xx 2 , jqj.exj 

for some constant k < n and an indicator function I. Then the following holds: 

(4.4.3) d(xi,x 2 )>d(yi,y 2 ), (xi, x 2 ), (y X) y 2 ) G X, Y 

Corollary 4.14. 7 Whenever the indicator functions Ti describes a contigu- 
ous subset of components, the following holds: 

(4.4.4) %)c(J«5'(Z iq )) 

i 

given S and S f defined as follows: 

(4.4.5) %) : ||X-q|| p <e 

(4.4.6) |J 5'(lq^) : ||2X-2,-q|| < 0(n a e) 

i 

for O(nfk) indicator functions and a constant a > 0. 



Theorem 4.15. 11 Let XUqC K™ be some finite discrete space, there exist 



f, g : 1" — > K such that the following holds: 

(4-4.7) IJ^^U^U^)) 

i i,j 

given 5 and 6' defined as follows: 

(4.4.8) \j8(<li): HX-qilloo^e 

i 

(4-4.9) |J 5'ifMid))-- ls(X) - ^(q^-) | = 

Theorem 4.16. Let f : X — > Y be a distortionless mapping given by a binary 
matrix G as follows: 

(4.4.10) /(X) = GX 

for some x G X. Then there exists G such that following holds for yi,Vj G /(X) 
and a constant e: 

(4.4.11) ||y 4 -y,||i>£, y^y, 

X and Y can be chosen such that X C B", Y C B™. 
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4.5. Characteristic Radius and Encodings. As a converse to the notion of 
error-correcting bounds, a characteristic radius can be denned on any finite discrete 
distance space. 

Definition 4.17. Given an isomorphic mapping / : X — > Y, the actual char- 
acteristic radius of / is the minimal radius of Y in the distance space (Y, d). 

This is given by: 

(4.5.1) r = mind(/(x i ),/(x i )) 
for all Xi, Xj € X. 

Given a normed finite discrete space X, the compactness c is then defined as: 

(4.5.2) c(X) = ||X|| 0O -r 

The set of all isomorphic mappings of some finite discrete space is equivalent to 
the set of all injective mappings g : / — > xN that are order invariant given some 
distance function / iXxX^if" For some continuous function f(x,y,z), 
this can be seen as the smoothing process given by: 

(4.5.3) f(z)=g°f(z) 

for some value z defining an arbitrary cutting plane. Or alternatively: 

(4-5-4) r X -ffo/- l | /( . irt =0 
which equivalently defines a projection to identity of / and g into 



Theorem 4.18. Given some discrete bounded space, a computable characteris- 
tic radius can be defined. 

Corollary 4.19. An unbounded discrete space has an actual characteristic 
radius of oo. 

Corollary 4.20. Any unbounded discrete space (X,<i) can be bounded through 
some mapping f : (X, d) — > (Y,e?'). 

Also, most practical results apply to some general finite discrete space or some 
non-trivial class of such spaces. As the characteristic radius decreases, the com- 
plexity of the encoding is said to increase. The complexity is then given by function 
of the compactness c(X). 

For some finite discrete space in R, a few additional results can be obtained. 

Theorem 4.21. Any finite discrete space X C M has a characteristic radius 
upper bounded by: 

(4.5.5) logHXIU 

Corollary 4.22. For some finite value n, any code X defined in N n has a 
minimal descriptive length upper bounded by 0(log ||X||oo). 

These results will be further discussed and shown in subsequent sections. Also, 
Corollary |4.22 can be seen as an information theoretical or algorithmic complexity 
bound on normed finite discrete spaces. Such bounds are directly proportional to 
the characteristic radius. A characteristic radius then qualifies how discrete a space 
is. It defines a compressibility coefficient and by extension a computability order. 
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Figure 4. Distinguishability on the line. 

4.6. Distinguishability On The Line. Let / define some function from 
some finite discrete space to a finitely computable subset of the real line. The 
distinguishability problem is then given equivalently to the derivation problem on 
the real line: 

(4.6.1) f x = k 

whenever Equation generates a finite computable set. 

It can be shown that the set of functions / that satisfies the distinguishability 
problem are given by: 

(4-6.2) f x - f- 1 = k 

whenever the equation is finite. 

Given such a function /, reducing the complexity of the distinguishability prob- 
lem can be done by finding the set of order preserving mappings of / given by: 

(4.6.3) 



whenever / is computable. 



Definition 4.23. Let / be a norm such that |4.6| holds, then / can be termed 
a distinguishing norm. 

The definition of discreteness in terms of Equation |4.6| can then be restated as 
follows. 

Theorem 4.24. Let f : X — >• Y be an asymptotically finite computable func- 
tion, then X is a discrete space. 

The complexity of / defines the computable radius of the discrete space which 
is conceptually equivalent to the characteristic radius. 



5. Poly logarithmic Reduction In Finite Space 

Let the g-number G^ of a sequence of N integers be defined on some factorial 
domain. 

Definition 5.1. A g-code denotes and by extension defines the set of reversible 
functions over some factorial domain. 
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A g-code uniquely defines an arbitrary sequence and is equivalent to the set 
of discrete spaces that can be encoded on a classical computer. Other than its 
celebrated use in logic and theoretical computer science, the uniqueness property 
of g-codes has been exploited in hashing algorithms. From the perspective of infor- 
mation and quantity, g-numbers typically have large logarithmic norms. 

Example 5.2. The problem of finding the minimal number of bits required to 
encode a given sequence of elements from some finite discrete space using (/-codes 
is directly related to known information theoretical bounds. Also, polylogarithmic 
bounds can be specified. This is shown for the particular cases of G N , G Nl Gn ' , 
G 32 at and G 64 tv and their associated distance spaces. 

Example 5.3. The general problem of determining the compressibility of a 
sequence has been shown to be uncomputable. More specifically, the algorithmic 
complexity of a sequence can be reduced to the Halting problem. From an infor- 
mation theoretic point of view, some results show that random sequences cannot 
improve on the logarithmic scale. Provided specific encodings and distributions, 
optimal bounds can be effectively be computed. By extension, while exact lower- 
bounds are in general uncomputable, known upper-bounds have been provided in 
the literature on the subject. 

Example 5.4. Polylogarithmic upper-bounds can be provided to the compress- 
ibility problem using integral prime g-codes. The encoding and decoding of g-codes 
requires the rapid resolution of the constrained prime factorization problem on 
reasonably integers while the problem of factoring a large integer on a logarithmic 
scale can be related to the problem of deciding efficiently the integral indeterminate 
polynomial equation given by: 

(5.0.1) x - T = 

for some i£N and a € R+. Whenever a is a rational, the problem can be solved 



trivially. The decision problem of Equation 5.0.1 is then to determine if an arbitrary 
power of two accepts an integer solution and corresponds to the integrals solutions 
of the base two function on a real domain. 

Also, it is known that the general solvability of integer-valued indeterminate 
polynomial equations is an undecidable problem. Also, in its simplest expression 
given by Equation |5.0.1| the problem is very decidable. 

An /-code is typically defined as follows: 

Definition 5.5. The 1- number of L^, of a sequence of cardinality TV is usually 
defined as: 

(5.0.2) L n N = f(G n N ) 

for some funtion /. 

In general, an g-code on some finite discrete space X can be defined as: 

(5.0.3) Ljy = f(G n N ) 

such that r(Ljv) = O (1°S I|Gjv||) f° r some given norm. 

Theorem 5.6. L 7 ^ defines a finite discrete space equipped with a characteristic 
radius. 
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5.1. L-Codes. From the perspective of compression, common results for g- 
codes can be discussed for ASCII, UTF, 32-bit encodings and 64-bit encodings. 

Theorem 5.7. Any prime bitwise g-code has a decoding complexity of O(N) 
using trial algorithms. 

A bitwise <?-code can then be denoted G^. In general, the following complexity 
result can be stated. 

Theorem 5.8. Given some basis f(N) € K. N , a g-code G 7 ^ has a size complex- 
ity upper-bounded by 0(log ll/O^OHIMloo)- 

An l-code on ASCII codes can be denoted as L%. The equivalent /-codes for 
UTF, 32-bit encodings and 64-bit encodings are respectively L^ 6,20 \ L^ and L^. 

Example 5.9. Let an l-code L^ of length A on an n-dimensional integer code 
be defined as in Equation |5.0.3| for / defining the logarithmic and inverse functions, 
a typical ordering of the Z-code is then given by, for N = kn: 

(ax, ... ,a„), ((ax, ... ,a„) + 1) mod A, ... , 
((ax, ■ ■ ■ , a„) + N/n - 1) mod A 

Example 5.10. The ordering of the linear Z-codes L]^ = ^ \ogpi and L]^ — 
^ a" 1 log ^ are given by: 

(&!,..., b 16 ) =(1, 17, 33, 49, 65, 81, 97, 113, 129, 145, 
'- 5 ' 1 ' 2 '' 161,177,193,209,225,241) 

(bx, . . . , b 1G ) =(241, 225, 209, 193, 177, 161, 145, 129, 

'- 5 ' 1 ' 3 '' 113,97,81,65,49,33,17,1) 

An order curve is then defined as the ordering of some factorial domain function 
on an indicator function. The order curve of L^(I) is shown in Figure [5] 



Example 5.10 defines an oriented lattice. Similar orders can be defined in 
— 0\\oo- The lattices of Example 5.10 have orientation 1 and -1 respectively. 



5.2. Jackson-Sheridan-Tseitin Transforms. Let S be a finite discrete space 
defining a Jackson-Sheridan-Tseitin transform [6 10 such that Si < . . . < s n and: 

(5.2.1) S'=Z(S®l a ») 

It can be shown that the order set of L^r(I) generates a quasi-lattice for N = 2 n . 
Let the quasi-lattice L^(X) be denoted f(x) and let the following order curve be 
defined as A[/] i (x) for some constant h = k. Then the following can be shown: 

(5.2.2) A[fUx) = ? + ? +1 (i-l) 
Or alternatively: 

(5.2.3) i = k mod 2 k 

Let S^ denote the order curve on L^(I) such that: 

(5.2.4) S^=L^(X) = L^(X(A®1 2JV )) 
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S^f can be denned as follows: 



N 



(5-2-5) ^ 



S n = La*/( s *) lo g^Jio g 
»=i 

N 

E^ lo g^J log + AS^(I) 



for some function /(sj), Oj € {0, 1}, Sj G S and a logarithmic code floor function. 
Pi then defines an arbitrary set of ordered primes. 

Remark 5.11. A set of good constants are such that /(sj « 1, In general, 
a set of good constants are such that ~ fcj.j,... such that |k| is bound 

(Equation [5. 2. 10| ). 

Following this remark, whenever /(s 2 ) ~ 1, the order of AS^r is equal to the 
order of L^, for some 1-function such that: 

(5.2.6) r(AS^) = o(||L^(AS^)|| 00 ) 
This is denoted: 

(5.2.7) Or(AS^) = Or(L^) 

Theorem 5.12. An ordered bounded space of cardinality 2™ can be searched in 
0(nlog2) whenever partial order can be decided in 0(1). 

Theorem 5.13. Let Or(L 7 ^) — Or{S^) on some function f defined as in Equa- 
tion ^. 275 . and let the I- code S^r be defined such that the following holds: 

(5.2.8) Or(L n N (n(0, 1, 2, 3, ... 2" - 1) 2 )) = Or(^) 

A set of constants /(cjSj) = Cif(si) can be found with space complexity Ci — 
OOogllSIU + nlogfc). 

Corollary 5.14. Let X = {xi, . . . ,Xjv} be some finite discrete space with 
order 0r(X). An order preserving function is such that for F^j = /(Xj), there 
exists f : F -> K N such that Or(f(X)) = Or{L n N {l)). 

Remark 5.15. Let (S/T, 1) be an arbitrary Jackson-Sheridan-Tseitin trans- 
form with solutions T s . A general form for distance equations that retrieves a set 
(L^) s is: 

n 

(5.2.9) \j5{q = A i ):d(q,B itj ) = l 

i=l 

such that j i and j el s . And: 

n 

(5.2.10) |J 6(q = A,): d(q, ./'•: I '•,,.( • . . . , %...)) = 1 
i=l 

such that . . . 7^ j 7^ i and j,... el s . Whenever a set Cjj of constants can be found 
such that Remark 15. Ill or Theorem 15.131 hold, the solution is said to be efficient. 
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Figure 5. Order curve of L%(X). 



Example 5.16. Starting from the ||S|| + ||S||i = 0(2KM (K + M)) Jackson- 
Sheridan-Tseitin representation, the problem can then be stated as such [6| |10] : 

(5.2.11) S' =1 (L 2 K+M (S) ® 1 2 ikm_ x ) 

such that Sij £ {0, 1} and <E S. Three i-codes can be constructed. The first one, 
on the decimal binary representation of S: 

K+M 

(5.2.12) I4 +M (S)= ]T L^logpdiog 



A second one on /(S'): 



2KM 



(5.2.13) 



J 2KM ' 



such that: 

(5-2.14) c& 3 K+M {Si) («,>) 1 

This satisfies Remark |5.11| and a candidate set of constants Cj can then be given as: 



(5.2.15) 



J K+M 



(Si))" 



0((JsT + M)l>«ioJ; 



log 



A third ^-code on S' is given by: 
(5.2.16) ^1 K km + ( k + m)^')= E IMogft, h&[0,k]\ los 



K+M 



In practice, rational- valued polylogarithmic functions can be used. 
The following properties hold on the order curves: 

(5.2.17) Or(L^ +M (S)) cOr(L»(Z)) 
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(5.2.18) Or {L 2 2KM (f(S')) = Or (L| +M (S)) 

(5.2.19) Or(q™ +M) )=A/B 

such that A = {0, . . . , l\i=i Pi} and B are the numbers factoring to at least a Pj 
such that j ^ i. The following cardinalities can then be computed: 

|A| = (K + M + l) K + M 
(5-2-20) ' 

|B \=n-{K + M + l) K+M 



The distribution of Or (J-^km^k+M)) behaves as follows: 

, x AOr(L^ +M) (S')) 
(5 - 2 - 21) ajjfc- AL 2KM f jW) = 1 

Given Equation 5.2.21 as the problem grows larger, the sparsity of the order curve 
decreases. Furthermore, the distributions of finite differences has a factorial growth 
complexity, It can then be said that the space complexity of the associated /-code 
is given by: 

(5-2.22) \lIkm(k + m)&)\ = 0(*log||p|U) 

Following this, a general form of an associated distance equation is given by: 

2KM 

(5.2.23) (J S(q = A t ) : q-S iJ =T 

i=i 

An actual equation can be given by: 

K+M K+M 
i£Z s i— 1 j^i k—1 

(5.2.24) ^ex, 

K M 
t=l i=K+l 

Alternatively in Z-code notation: 

(5.2.25) U S (ti : 9+ a 2»f(K+M)( I ) =T 

Since the zero distance never occurs, Equation |5.2.23| can be termed a distance 
equation. The approximation factor can be set to equality using Theorem |5.22| 

Property 5.17. A totient function can be given to count the elements in a 
prime or log-prime order: 

(5.2.26) <t>{n,p) =«[[ ( 1 - ^ 
such that gcd{n,pi) 1. 



log 



p, 



In practice, arithmetic progressions or some arbitrary ordered sequence of 
primes are sufficient, or alternatively, a set of p-adic numbers. 

This result can be generalized to factorial domains as defined in Definition [L6] 
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Property 5.18. The following on ordered primes holds: 

(5.2.27) J2 l °SP* < Yl lo S^"' 1 C J 

Remark 5.19. Let the following sequence of iterates be defined on the order 
given by Equation [5~2~19l 

(5.2.28) fi=x, xtOr(L™%$ +M) ) 

It can be shown that log i log^ fi is an inverse sigmoidal function. 

Following Remark |5.19| a random sample at the extremities of the order can 
be used to approximate the value log^og^. From the approximated value, the 
lowest-nearest value to the sequence of iterates can be chosen. 

Theorem 5.20. Or(VJj(T)) can be computed using 0(\ogN) recursions. 

Theorem 5.21. On a logarithmic factorial prime domain, the space complexity 
of a finite discrete space X is upper bounded by: 

(5.2.29) OOogH-XlU) 

Theorem 5.22. A code of cardinality k in some distance space is reversible if 
it provides a k-partition. 

The order on the Z-codes of some indicator function can be computed without 
the knowledge of the associated Z-codes. The indicator function defines a space 
of complexity 0(2") or 0(n n ). The complete search can then be done in approx- 
imately 0(n log 2) and O(nlogn) respectively. A relaxation of the problem only 
requires the knowledge of the number of elements before and after a given element 
in the l-code. This can be done in 0(1). The upper-bound provided on the finite 
difference of the class of integral prime Z-codes then provides a bound on the space 
complexity of the coding. 



5.3. Further Remarks on g-Codes and 1-Codes. 

5.3.1. Iterated Encodings. 

Definition 5.23. Let n be the length of a code, N, the number of codes in a 
(g,l)-code, i, the number of iterations and j the number of (g,l)-codes in a given 
iteration, an iterated encoding is then recursively defined as follows: 

(5-3-1) p n N = )\G n N \ oUKc.oUjGl; 

And similarly: 

Iterated encodings preserve the complexity of a single iteration whenever the 
number of iterations is 0(1). This is denoted: 

(5.3.3) ECDC = O (eCDC (g^^)) 
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5.3.2. Limiting Encodings. An infinite sequence is incompressible using g-codes, 
which follows the classic argument in algorithmic complexity that shows that the 
problem of defining such a code is in general uncomputable. The logarithmic upper- 
bound of Z-codes follows the classic information theoretical argument. 

The basic results of algorithmic complexity can be restated as such for codes 
and sequences. 

Theorem 5.24. An infinite code may or may not have a finite algorithmic 
encoding. 

Theorem 5.25. The number of infinite codes that have a finite algorithmic 
encoding is finite. 

Theorem 5.26. The problem of determining the existence of a finite algorith- 
mic encoding for an infinite code is decidable. 

In the context of finite discrete spaces, the main result is as follows. 

Theorem 5.27. Finite codes have an algorithmic encoding upper-bounded by 
log N where N is given by the norm. 

5.3.3. l-Functions. An 1-function is defined as a mapping from an arbitrary 
encoding to a set of i-codes. Formally: 

Definition 5.28. Let 1 be an 1-function. The following then holds: 

(5-3.4) /(X) = {-L-,...,-L-} 

Reversibility and uniqueness properties can be discussed in the context of 1- 
functions. 

Definition 5.29. An 1-function is said to be reversible if and only if the fol- 
lowing holds: 

(5.3.5) /(X) = Ufc(X fc ) 

k 

for some bijective function / defining a unique Z-code. 

5.3.4. Complete l-Functions. Complete encodings define as usual the set of en- 
codings that are Turing complete. 

Theorem 5.30. The following sequence of iterates illustrates a Turing machine: 

(5.3.6) Xj = x,_i +Ti,i = 1, . .. 
with the following: 

(5.3.7) [>(x fc ) = |J^( x ) 

k k 

The Z-code representation of a universal encoding is denominated a complete 
1-function. Theorem |5.30| provides a class of simple 1-functions built on the permu- 
tation space of an algorithmic encoding. 
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5.3.5. Application of I- Function on (1,0). Let some function / define a sorting 
algorithm known as bead sort [l] on N random integers B in their matrix rank 
order representation such that: 



M,l 



(5.3.8) |J fi(b jtk) b j+ i, k ) = 

k,j=l,N 



bj+l,k 

bj,k 



bj,k 
"j,k+l 



, (b j<k ,b j+1<k ) (1,0) 



, otherwise 



and: 



(5.3.9) 



B = 



bi.2 

&JV.1 



bN,M 



with b it j e {0, 1}. 

Then the following representation can be used: 

(5.3.10) 



Equation [5XT0l can be denoted as the /^-interpolation of / and defines the upper- 
bound of on the following complexity space: 





0(f) 


0(|/|) 


(5.3.11) 


0(f) 


0(N\) 




0(N) 


0(1) 



This is shown in Figure [6j 
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